Web Adoption: An Attempt Toward Classifying Risky Internet Web Browsing Behavior
نویسندگان
چکیده
• Background. This paper explores associations of computer compromise events in relationship to web browsing activity over a population of computers. • Aim. Our hypothesis was that computers are more likely to be compromised in comparison to other computers when the computer regularly browses to web sites prior to other computers visiting the same site (early adopters) or browses to unique web sites that no other computer visited (unique adopters) in a given time period. • Method. Web proxy data and associated computerspecific compromise events covering 24,000+ computers in a contiguous 6 month time period were used to group computers in various adopter categories and compare potential compromise events between the groups. • Results. We found distinction in web surfing behavior, in some cases differentiating the chance of compromise from 2.5-fold to over 418-fold between certain adopter categories. However, the study also showed no additional value in predicting compromise using these more complex adopter categories when compared to using simple unique web activity counts. As additional contributions, we have characterized several large, real-work cyber defense relevant data sets and introduced a method for simplifying web URLS (client web requests) that reduces unwanted uniqueness from dynamic content while preserving key characteristics. • Conclusions. We found that a count of unique web visits over time has the same level of predictive power for potential compromise as does the more complicated web adopter model. Both models have better than chance levels of prediction but also reinforce the idea that many factors beyond elements of web browsing activity are associated with computer compromise events. Nonetheless, our adopter model may still have value in objective computer risk determination based on web browsing behavior.
منابع مشابه
Extracting user web browsing patterns from non-content network traces: The online advertising case study
Online advertising is a rapidly growing industry currently dominated by the search engine ’giant’ Google. In an attempt to tap into this huge market, Internet Service Providers (ISPs) started deploying deep packet inspection techniques to track and collect user browsing behavior. However, these providers have the fear that such techniques violate wiretap laws that explicitly prevent interceptin...
متن کاملIgnorance is Bliss: The Effect of Increased Knowledge on Privacy Concerns and Internet Shopping Site Personalization Preferences
Studies have shown that people claim that privacy matters to them but then they often do things while browsing that are risky in terms of privacy. The seeming inconsistency between professed privacy concerns and risky behavior on the Internet may be more a consequence of ignorance rather than irrationality. It is possible that many people simply don’t understand the technologies, risks, and reg...
متن کاملDesigning a System for Trend Analysis of Users in Website Surfing in Iran Using Data Mining and Text Mining Algorithms
Background and Aim: As of the entrance of web surfing to the lifestyle of a vast majority of people in the society and the need for a more accurate social and cultural policy making in the field, authors intended to analyze the behavior of the society users in viewing different websites so as to help politicians and practitioners. Methods: Design science research method is used in this research...
متن کاملA Graph-Based Web Usage Mining Considering Page Browsing Time
With the increase of large web sites which have complex link structures, web access logs have caught attention as a clue for web site administrators to understand user’s needs and demands. While conventional statistical analysis is used for most of the cases, web usage mining is an emerging attempt to apply data-mining based technique to web access log analyses. However, statistical and data-mi...
متن کاملPredictive Analysis of Users Behaviour in Web Browsing and Pattern Discovery Networks
Web-Page prediction is a classification problem in which we attempt to predict the next set of Web pages that a user may visit based on the knowledge of the previously visited pages. To achieve this we depend on the web access log files which are recorded in the server. These web access log files can be mined to extract interesting pattern so that the user behavior can be understood. Predicting...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013